660 research outputs found

    Can Who-Edits-What Predict Edit Survival?

    Get PDF
    As the number of contributors to online peer-production systems grows, it becomes increasingly important to predict whether the edits that users make will eventually be beneficial to the project. Existing solutions either rely on a user reputation system or consist of a highly specialized predictor that is tailored to a specific peer-production system. In this work, we explore a different point in the solution space that goes beyond user reputation but does not involve any content-based feature of the edits. We view each edit as a game between the editor and the component of the project. We posit that the probability that an edit is accepted is a function of the editor's skill, of the difficulty of editing the component and of a user-component interaction term. Our model is broadly applicable, as it only requires observing data about who makes an edit, what the edit affects and whether the edit survives or not. We apply our model on Wikipedia and the Linux kernel, two examples of large-scale peer-production systems, and we seek to understand whether it can effectively predict edit survival: in both cases, we provide a positive answer. Our approach significantly outperforms those based solely on user reputation and bridges the gap with specialized predictors that use content-based features. It is simple to implement, computationally inexpensive, and in addition it enables us to discover interesting structure in the data.Comment: Accepted at KDD 201

    On Discrimination Discovery and Removal in Ranked Data using Causal Graph

    Full text link
    Predictive models learned from historical data are widely used to help companies and organizations make decisions. However, they may digitally unfairly treat unwanted groups, raising concerns about fairness and discrimination. In this paper, we study the fairness-aware ranking problem which aims to discover discrimination in ranked datasets and reconstruct the fair ranking. Existing methods in fairness-aware ranking are mainly based on statistical parity that cannot measure the true discriminatory effect since discrimination is causal. On the other hand, existing methods in causal-based anti-discrimination learning focus on classification problems and cannot be directly applied to handle the ranked data. To address these limitations, we propose to map the rank position to a continuous score variable that represents the qualification of the candidates. Then, we build a causal graph that consists of both the discrete profile attributes and the continuous score. The path-specific effect technique is extended to the mixed-variable causal graph to identify both direct and indirect discrimination. The relationship between the path-specific effects for the ranked data and those for the binary decision is theoretically analyzed. Finally, algorithms for discovering and removing discrimination from a ranked dataset are developed. Experiments using the real dataset show the effectiveness of our approaches.Comment: 9 page

    A large scale hearing loss screen reveals an extensive unexplored genetic landscape for auditory dysfunction

    Get PDF
    The developmental and physiological complexity of the auditory system is likely reflected in the underlying set of genes involved in auditory function. In humans, over 150 non-syndromic loci have been identified, and there are more than 400 human genetic syndromes with a hearing loss component. Over 100 non-syndromic hearing loss genes have been identified in mouse and human, but we remain ignorant of the full extent of the genetic landscape involved in auditory dysfunction. As part of the International Mouse Phenotyping Consortium, we undertook a hearing loss screen in a cohort of 3006 mouse knockout strains. In total, we identify 67 candidate hearing loss genes. We detect known hearing loss genes, but the vast majority, 52, of the candidate genes were novel. Our analysis reveals a large and unexplored genetic landscape involved with auditory function

    First results from the AugerPrime Radio Detector

    Get PDF

    Update of the Offline Framework for AugerPrime

    Get PDF

    Combined fit to the spectrum and composition data measured by the Pierre Auger Observatory including magnetic horizon effects

    Get PDF
    The measurements by the Pierre Auger Observatory of the energy spectrum and mass composition of cosmic rays can be interpreted assuming the presence of two extragalactic source populations, one dominating the flux at energies above a few EeV and the other below. To fit the data ignoring magnetic field effects, the high-energy population needs to accelerate a mixture of nuclei with very hard spectra, at odds with the approximate E2^{-2} shape expected from diffusive shock acceleration. The presence of turbulent extragalactic magnetic fields in the region between the closest sources and the Earth can significantly modify the observed CR spectrum with respect to that emitted by the sources, reducing the flux of low-rigidity particles that reach the Earth. We here take into account this magnetic horizon effect in the combined fit of the spectrum and shower depth distributions, exploring the possibility that a spectrum for the high-energy population sources with a shape closer to E2^{-2} be able to explain the observations

    Event-by-event reconstruction of the shower maximum XmaxX_{\mathrm{max}} with the Surface Detector of the Pierre Auger Observatory using deep learning

    Get PDF

    Reconstruction of Events Recorded with the Water-Cherenkov and Scintillator Surface Detectors of the Pierre Auger Observatory

    Get PDF

    Status and performance of the underground muon detector of the Pierre Auger Observatory

    Get PDF

    The XY Scanner - A Versatile Method of the Absolute End-to-End Calibration of Fluorescence Detectors

    Get PDF
    corecore